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ABSTRACT 


MicroRNAs (miRNAs) are one class of short, endogenous RNAs which can regulate gene expression at the post-transcriptional 
level. Previous analysis revealed that mammalian miRNAs tend to cluster on chromosomes. However, the functional 
consequences of this clustering and conservation property are largely unknown. In this study we present a method to identify 
signaling pathways targeted by clustered miRNAs. We performed a computational screen for mouse signaling pathways targeted 
by miRNA clusters. Here, we report that the target genes of 3 miRNA clusters are overrepresented in 15 signaling pathways. We 
provided experimental evidence that one miRNA cluster, mmu-mir-183—96—182 targets Irs1, Rasa1, and Grb2, all of which are 
located in the insulin signaling pathway. Theses results suggest that by targeting components with different roles along a signaling 
pathway, different members of one miRNA cluster can act as a whole to coordinately control the signal transduction process. 
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INTRODUCTION 


MicroRNAs (miRNAs) are a class of ~22-nucleotide (nt) 
endogenous single-stranded RNA which can cleave mRNA 
or repress the initiation of protein translation by imperfect 
base pairing with the targeting sequences on 3'-UTR in 
animals (Bartel 2004). miRNAs are transcribed by RNA 
polymerase II or III as long primary miRNAs (pri-miRNAs) 
which range from hundreds to thousands of nucleotides in 
length. The pri-miRNAs are then processed into mature 
miRNAs by Drosha and Dicer enzymes sequentially. Studies 
have shown that miRNAs play crucial roles in a broad range 
of biological processes including development, differentia- 
tion, tissue morphogenesis, and also in several types of 
diseases such as carcinogenesis and viral infections (Bartel 
2004; Kloosterman and Plasterk 2006). 

Previous analysis has revealed conserved patterns as 
clustering of mammalian miRNAs on chromosomes 
(Altuvia et al. 2005). Megraw et al. (2007) reported that 
in human, mouse, rat, and chicken, even at a very con- 
servative maximum inter-miRNA distance (MID) of 1 kb, 
over 30% of all miRNAs are organized into clusters. If the 


MID extends to 50 kb, over half of all miRNAs are clustered 
on chromosomes. Expression profiling analysis showed that 
proximal pairs of miRNAs are generally coexpressed, and 
an abrupt transition in the correlation between pairs of 
expressed miRNAs occurs at a distance of 50 kb (Baskerville 
and Bartel 2005). According to their chromosomal posi- 
tions, several miRNA clusters have also been experimentally 
confirmed by RT-PCR or Northern blotting (Cai et al. 
2004; Lee et al. 2004; Weston et al. 2006). These evidences 
strongly suggested that members of a particular miRNA 
cluster, which are defined by their chromosomal positions, 
are highly likely to be processed as cotranscribed units. 
However, the functional consequences of this clustering 
and conserved property remain unclear. 

Analogous to transcriptional regulation (between tran- 
scriptional factors and cis-regulatory elements), most miR- 
NAs function to fine tune the expression of hundreds of 
genes in a combinatorial manner (Bartel 2004). Specifically, 
different miRNA species have been proposed to regulate 
a single mRNA target through binding to the same mRNA 
transcript. Several such cases have also been experimen- 
tally verified. For example, Krek et al. (2005) experimen- 
tally validated that myotrophin (Mtpn) as the first 
mammalian gene regulated by miR-375, miR-124, and let- 
7b coordinately. Other studies also showed that miRNAs 
17-5p, 20a, and 106a control monocytopoiesis via binding 
to the 3'-UTR of transcription factor acute myeloid 
leukaemia-1 (AMLI) mRNA (Fontana et al. 2007). 


Dews et al. (2006) recently reported the first evidence of 
a cluster of miRNAs that could regulate functionally related 
proteins in cancer. They found that in Myc overexpressing 
tumors, two anti-angiogenesis genes Tspl and Ctęf are 
downregulated by one Myc activated miRNA cluster miR- 
17-miR-92. Noticeably, they confirmed that within the 
cluster, miR-19 is primarily responsible for Tspl down- 
regulation and miR-18 for Ctgf downregulation in response 
to Myc activation. Inspired by their findings, in this paper, 
we combined the combinatorial regulation concept with 
the clustering property to screen mouse signaling pathways 
which are targeted by clustered miRNAs. Especially, we 
explored another layer of combinatorial regulation, i.e., we 
hypothesize that each member of a miRNA cluster could 
regulate one component within the same signaling path- 
way; thus, the cluster acts coordinately as a whole to 
control a biological process. 


RESULTS 


A computational screen framework for mouse 
signaling pathways targeted by miRNA clusters 


As indicated in Figure 1, to identify mouse signaling 
pathways targeted by miRNA clusters, we initially extracted 
mouse miRNA clusters according to mouse chromosomal 
positions. Although previous studies had indicated that 
defining clustered miRNA memberships according to chro- 
mosomal location is highly consistent with experimental 
evidences as RT-PCR or Northern blotting, we further 
filtered miRNA clusters through (1) an expression profiling 
filter and (2) a regulatory element filter. 
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FIGURE 1. Flowchart of the computational screen framework for 
mouse signaling pathways targeted by the miRNA cluster. The 
algorithm is divided into two stages. The initial stage is to preprocess 
both the mouse miRNA clusters and the Biocarta signaling pathways. 
The second stage involves selecting the significant pathways that are 
enriched with target genes of one miRNA cluster. 


For any particular miRNA cluster that shares the same 
transcriptional unit, the expression profiles of miRNAs 
within this cluster should be similar, and thus positively, 
correlated. We filtered out those miRNA clusters with the 
mean correlation measures that are less than those of the 
background correlation in at least two of the three analyzed 
expression profiles. In addition, we examined for the pres- 
ence of regulatory elements such as transcription start site 
(TSS), CpG window, or predicted first exon in the inter- 
miRNA region of the same cluster. All of these possibly 
indicated a different transcription initiation. We extensively 
searched for the elements in the inter-miRNA region of 
the same cluster, and we filtered out the miRNA clusters 
in which regulatory elements are found. After filtering 
signaling pathways, we selected the significant signaling 
pathways that are enriched with those predicted target 
genes according to the optimized target prediction data. 


Filtering miRNA clusters 


Forty-five miRNA clusters which contained 169 miRNAs, 
were extracted when MID was set to 5 kb. The largest 
miRNA cluster mmu-mir-669a-3 is located on chromo- 
some 2 with 25 miRNA members (Supplemental Table S1). 
Among these 45 miRNA clusters, 40 clusters have a 
correlation value higher than 0.35, the background corre- 
lation value of data set published by Thomson et al. (2004). 
Nineteen out of the 45 clusters have correlation measures 
significantly higher than the background (nominal P < 
0.05) (Supplemental Table S1). Previously, the miRNA 
members of the mir-17 cluster show consistent upregula- 
tion patterns in a Myc-overexpressing Ras-transformed 
colonocytes (Dews et al. 2006). However, the expressions 
of these miRNA members are not correlated in two out of 
three analyzed profiles (Supplemental Table S1), perhaps 
suggesting that this cluster is only regulated conditionally. 
In addition, three other miRNA clusters mmu-mir-212, 
mmu-mir-363, and mmu-mir-382 also do not coexpress in 
various cellular conditions, indicating that the members 
within these clusters have independent regulation mecha- 
nisms (Supplemental Table S1). Nonetheless, most miRNA 
clusters defined by chromosomal locations have coex- 
pressed patterns under different cellular conditions, con- 
sistent with previous observations (Baskerville and Bartel 
2005). 

We also filtered the clusters according to regulatory 
element presence using both FirstEF and Eponine, and 
found regulatory elements within mmu-mir-212—132 clus- 
ter (Supplemental Table S1). Interestingly, the expression 
profiling filter also shows that the expressions of these two 
miRNAs are not correlated in all three analyzed expression 
profiles. Collectively, these evidences strongly suggested 
that these two miRNAs have independent transcription 
mechanisms, and they may not function coordinately to 
regulate signaling pathways. 


After these preprocessing phases, we finally obtained 41 
miRNA clusters, which show coregulation patterns. Similar 
analysis were also conducted when MID was set to 50 kb. In 
all, 55 miRNA clusters were initially selected, and after the 
preprocessing phases, 45 clusters were used in the following 
analysis (Supplemental Table S2). 


Selection of significant pathways that are enriched 
with target genes of one miRNA cluster 


Next, we selected significant pathways that are enriched 
with target genes of one miRNA cluster among the filtered 
97 signaling pathways. As indicated in Table 1, totally 18 
cluster-pathway interactions were found by false discovery 
rate analysis (false discovery rate [FDR] <0.3; nominal P < 
0.03), which involved three miRNA clusters and 15 sig- 
naling pathways. 

Interestingly, one miRNA cluster mmu-mir-183—96—182 
controls 12 signaling pathways. Detailed analysis of the 
target genes by this cluster in these 12 significant pathways 
suggested that miR-96 is the major regulator. Other than a 
few exceptions, miR-96 targets most of the member genes 
in these 12 signaling pathways (Table 2). Previously, the 
pri-miRNA polycistronic organization of this cluster has 
been identified by experimental methods and the coex- 
pression patterns of this cluster were also confirmed 
(Weston et al. 2006; Xu et al. 2007). 

Consistent with our results, this cluster seems to be 
involved in at least three biological functions according to 
the current literature. First, it was reported that one mem- 
ber of this cluster, mir-96, targets protein arginine methyl- 
transferase (PRMT5). Aberrant expression of PRMT5 


leads to altered epigenetic modification of chromatin, 
and consequently impacts on the growth of transformed 
lymphoid cells (Pal et al. 2007). Second, in colorectal 
cancer cell lines, all three members of this cluster show con- 
sistent upregulations. Forkhead family of transcription fac- 
tors including CHES 1, FOXF2, FOXK2, FOXOLlA, FOXO3A, 
and FOXQI, were proposed as putative targets (Bandres 
et al. 2006). Finally, Xu et al. (2007) reported that Mitf, a 
transcription factor required for the establishment and 
maintenance of retinal pigmented epithelium, is a direct 
target of miR-96 and miR-182, indicating that this miRNA 
cluster is also important for retinal development and 
function. We experimentally proved that this cluster is also 
involved in insulin signaling pathway (see below). Therefore, 
these evidences suggested that this cluster plays an important 
role in multiple signaling pathways. 

We also generated a set of experimentally testable 
hypotheses. For example, the epigenetic regulation of 
chromatin domains by miRNAs has recently been demon- 
strated through the finding that miR-26a targets the his- 
tone methyltransferase enhancer of zeste homolog 2 during 
myogenesis (Wong and Tellam 2008). Our prediction 
found that both mmu-mir-206 and mmu-mir-133a-1 
clusters target chromatin remodeling hSWI/SNF ATP- 
dependent complexes, which is a major group of chromatin- 
modifying complexes. It would be interesting to test 
whether these miRNA clusters can regulate the chromatin 
structure via a mechanism involving hSWI/SNF ATP- 
dependent chromatin remodeling complexes. The same 
set of cluster-pathway interactions were obtained when 
the MID set to 50 kb (data not shown), indicating ro- 
bustness of our prediction results. 


TABLE 1. Significant pathways that are enriched with target genes of miRNA clusters (MID <5 kb) 


miRNA cluster Signaling pathway 


Target gene 


mmu-mir-206 
mmu-mir-206 
mmu-mir-206 
mmu-mir-183 
mmu-mir-183 
mmu-mir-183 
mmu-mir-183 
mmu-mir-183 
mmu-mir-183 
mmu-mir-183 
mmu-mir-183 
mmu-mir-183 
mmu-mir-183 
mmu-mir-183 
mmu-mir-183 
mmu-mir-133a-1 
mmu-mir-133a-1 
mmu-mir-133a-1 


Chromatin remodeling by hSWI/SNF ATP-dependent complexes 
FOSB gene expression and drug abuse 

Sumoylation by RanBP2 regulates transcriptional repression 
Integrin signaling pathway 

Bioactive peptide-induced signaling pathway 

Activation of cAMP-dependent protein kinase, PKA 

G-Protein signaling through tubby proteins 

PTEN-dependent cell cycle arrest and apoptosis 

lon channel and phorbal esters signaling pathway 

Eph Kinases and ephrins support platelet aggregation 

Insulin signaling pathway 

IGF-1 signaling pathway 

Role of Erk5 in neuronal survival 

Links between Pyk2 and map kinases 

Rac1 cell motility signaling pathway 

Chromatin remodeling by hSWI/SNF ATP-dependent complexes 
FOSB gene expression and drug abuse 

Sumoylation by RanBP2 regulates transcriptional repression 


Smarca4, Smarcb1, Tbp, Smarcd1 
Fosb Jund 

Ranbp2, Sumo1, Hdac4 

Cav1, Crkl, Fyn, Grb2, Itgb1, Shc1 
Fyn, Gnb1, Grb2, Prkca, Plcg1, Shc1 
Gnb1, Prkarla 

Gnb1, Htr2c, Tub 

Grb2, Itgb1, Shc1, Foxo3a 

Prkca, Plcg1 

Epha4, Fyn, Itgb1, L1cam 

Grb2, Irs1, Shc1, Rasal 

Grb2, Irs1, Shc1, Aasa 7 

Creb1, Grb2, Plcg1, Shcl 

Crkl, Grb2, Pak1, Plcgl, Rac1, Shc1, Map2k3 
Cfl1, Pak1, Pdgfra, Pld1, Rac7 
Smarca4, Smarcb1, Tbp 

Fosb Jund 

Ranbp2, Sumo1, Hdac4 


A FDR <0.3(nominal P < 0.03) was considered significant. 


TABLE 2. Target genes by mmu-mir-183—96-182 in the 12 signifi- 
cant pathways 


Target gene miR-96 miR-182 miR-183 


Cav1 X 

C77 X 

C7e り 7 X X 
Crkl X 

Epha4 X 
Foxo3a X 
Fyn X 
Gnb1 X 
Grb2 X X 
Htr2c X X 
Irs X 

Itgb1 X 
L1cam X 
Map2k3 X 
Pak1 X 
Pdgfra X 
Plcg1 X 
Pld1 X 
Prkarla X 
Prkca X 
Racl X 
Rasal X 
Shel X 
Tub X 


x Kx x XX 


x X 


Target genes set include all genes in the 12 selected significant 
pathways targeted by at least one member of the mir-183—96-182 
cluster. “X” means a hit by the intersection of PicTar and 
TargetScanS predictions. 


Experimental verification of one miRNA cluster 
targeting the insulin signaling pathway 


Insulin signaling pathway is involved in both metabolic and 
mitogenic aspects of cellular function. In our prediction, 
one miRNA cluster mmu-mir-183-96-182 targets four 
members of the insulin signaling pathway. In the intersec- 
tion of PicTar and TargetScanS prediction, both mir-96 and 
mir-183 target insulin receptor substrate 1 (irsl), whereas 
mir-96 and mir-182 target both ras p21 protein activator 1 
(rasal) and growth factor receptor bound protein 2 (grb2). 

We set up luciferase reporter assays to validate these 
predicted targets. Fragments of the 3'-UTR of irsl, rasal, 
and grb2, including the miRNA target sites, were individ- 
ually cloned into a firefly luciferase expression vector pGL3. 
Cells were respectively transfected with these reporters 
together with a renilla luciferase vector pRL-TK as an 
internal control. While using mock transfected cells as a 
control (no oligos), we examined the effects of over- 
expressing different miRNA oligos on the expressions of 
these reporters. First, we found that a nonspecific oligo 
(NO), even at a relatively high concentration, did not alter 
the expression of all three reporters (Fig. 2A-C). Second, a 
mir-153 oligo which is predicted to target irs2, did not sig- 
nificantly alter the gene expression of a reporter containing 
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FIGURE 2. Repression of irsl, rasal, and grb2 3'-UTR by mmu-mir- 
183—96-182 cluster. 3'-UTR of target genes were inserted into the 
3’ end of the luciferase gene in the pGL3 control vector. Cells were 
transfected with or without 10 nM miRNA oligos indicated, pGL3 
vectors containing the 3'-UTR of target genes, and an internal control 
Renilla luciferase vector. The luciferase activity levels were determined 
after 24 h. Data were generated from triplicate in at least three 
independent experiments. The ratios of Firefly luciferase to Renilla 
luciferase expression were normalized to no oligos transfection. **P < 
0.01 for no oligos versus various oligos treatments, NS: no significant 
difference. (A) pGL-3-Irsl plasmid cotransfected with miR-153, miR- 
183, miR-96, miR-182, and nonspecific oligos (NC). NC, miR-153, 
and miR-182 were employed as negative controls as predicted by both 
Pictar and TargetScanS algorithms. (B) pGL-3-Rasal plasmid cotrans- 
fected with miR-183, miR-96, miR-182, and nonspecific oligos (NC). 
Both NC and mir-183 were employed as negative controls as predicted 
by both Pictar and TargetScanS algorithms. (C) pGL-3-Grb2 plasmid 
cotransfected with miR-183, miR-96, miR-182, and nonspecific oligos 
(NC). Both NC and mir-183 were employed as negative controls as 
predicted by both Pictar and TargetScanS algorithms. A dotted 
rectangle indicates the miRNA cluster mmu-mir-183—96—182. 


the 3’-UTR of irsl (Fig. 2A). Third, we experimentally 
confirmed that both mir-96 and mir-183 repressed the 
gene expression of this reporter about 40% via targeting 
the 3’-UTR of irsl (Fig. 2A). The 3'-UTR of irs] does not 
contain a targeting sequence for another member of this 
cluster mir-182. We did not observe any significant effects 
of mir-182 oligo on the expression of this reporter, 
demonstrating the specificity and validity of our reporter 
system (Fig. 2A). 

We also observed strong repression of a luciferase 
reporter through presumed binding of mir-182 to the 3'- 
UTR of rasal (Fig. 2B). Moderate but significant repression 
of a luciferase reporter was also observed in pGL-3-grb2 
plasmid cotransfected with mir-96 or mir-182 oligos (Fig. 
20). Although rasal was also a predicted target of mir-96 
by optimized prediction method, there was no significant 
difference between cotransfections with or without mir-96 
oligos, which possibly indicated that this was a false 
positive prediction. 


DISCUSSION 


One principle of gene regulation by miRNA is coordinate 
gene expression. In this model, individual miRNA may 
reduce the expression of target genes moderately while a 
collection of miRNAs may act in a combinatorial fashion 
to exert significant effects (Bartel 2004; Stark et al. 2005). 
Bioinformatics analysis agrees with this principle as well, 
since it is common to find multiple and independent 
binding sites on most, if not all, target genes. Different 
groups had also reported in experimental instances that 
miRNAs can act cooperatively through multiple target sites 
on the same gene. Recent studies have widened the scope of 
this combinatorial regulation such that a few miRNAs can 
regulate a group of functionally related genes instead of a 
single gene (Hua et al. 2006). These are convenient and 
efficient strategies to regulate cellular processes, but the key 
issue is that these regulatory miRNAs should themselves be 
regulated coordinately. Therefore, certain mechanisms 
must exist to guarantee the coordinate expression of these 
miRNAs. 

Combining the coregulation concept with the observation 
that mammalian miRNAs tend to cluster on chromosomes, 
we proposed that the clustering and conserved patterns 
provide a natural mechanism to explain the principle of 
coregulation by miRNAs. Since those clustered miRNAs are 
largely regulated under the same control, upon stimulation, 
members within the same cluster could express and function 
in a cooperative manner. Thus, the downstream effect, for 
example, to completely shut down a biological process, is 
ensured. However, the above regulation model does not rule 
out the possibility that a single miRNA could greatly alter 
a cellular process. For instance, a miRNA may target an 
essential regulator of a signaling pathway which leads to 
alterations in multiple downstream effectors. 


From the discovery of miRNAs, plenty of evidences have 
demonstrated the increasingly important role of these tiny 
molecules in developmental processes and carcinogenesis. 
However, relatively few are known for their regulation on 
cellular metabolic processes. Here, we provided the first 
evidence of an interplay between one miRNAs cluster 
mmu-mir-96—182—183 and the insulin signaling pathway. 

The IRS proteins are major players for the metabolic 
aspect of the insulin signaling cascade because they mediate 
the activations of intracellular signaling molecules, such as 
PI 3-kinase (PI3k), Akt, and molecular target of rapamycin 
(mTOR). On the other hand, the proliferative and mito- 
genic effects of insulin are mediated largely via the 
activation of Ras signaling pathway (Draznin 2006). Grb2 
is a classical adapter molecule that controls Ras and MAPK 
activations, which are important for growth factor signal- 
ing. Therefore, Grb2 is a potential target of antitumor 
therapy (Dharmawardana et al. 2006). Rasal, which trans- 
forms active GTP-bound Ras into inactive GDP-bound 
form, links the insulin signaling with Ras signaling path- 
way. Loss of Rasal could lead to increased activation of 
Ras, which has been linked to various tumors (Downward 
2003). 

Importantly, we showed that three members of mmu- 
mir-96-182—183 have distinct but coordinate roles in 
controlling insulin signaling. Mir-96 and mir-183 are 
mainly responsible for regulating Irs], which is the key 
mediator of metabolic aspect of insulin pathway, whereas 
mir-182 targets Rasal and Grb2, both of which play 
important roles in linking insulin signaling with the Ras 
pathway. Thus we proved that by targeting components 
with different roles along the insulin signaling pathway, 
mmu-mir-183—96-182 can coordinately control the insulin 
signal transduction. This regulation model is illustrated in 
Figure 3. Due to the potential bias of a reporter assay, 
further in vivo experiments are required to confirm the 
regulation and to establish the physiological consequences. 

Recently, several groups reported about the global anal- 
ysis of predicted miRNA targets with sets of genes (i.e., 
pathway or gene ontology [GO] term). Gusev et al. (2007) 
reported GO terms as well as biological functions targeted 
by five coexpressed miRNAs sets. Cui et al. (2006) analyzed 
distribution of the predicted miRNA targets in the cellular 
signaling network. But no one explicitly explored the 
functional consequence of clustered miRNAs with signaling 
pathways. Our approach can be extended forward to GO 
and other pathway system (i.e., KEGG) analysis as well as 
human pathway analysis. It is also important to identify 
signaling pathways that are conditionally and tissue specif- 
ically regulated by miRNA clusters. This would require 
further fine tuning the mining methods and obtaining 
additional miRNA expression profiles under specific con- 
ditions. However, this direction of research is beyond 
the scope of our current study. Such systems biological 
approaches will provide insight into the principles of the 
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FIGURE 3. Model of the regulation of the insulin signaling pathway 
by miRNA cluster mmu—mir-183—96—182. Mir-96 and mir-183 are 
mainly responsible for regulating Irs1, which is the key mediator of 
the metabolic aspect of the insulin pathway, whereas mir-182 targets 
Rasal and Grb2, both of which play important roles in linking insulin 
signaling with the Ras pathway. A dotted arrow indicates the 
experimentally verified miRNA and target gene interactions via 
luciferase activity assay. 


regulation of signaling networks by miRNAs and facilitate 
experimentally identifying interactions between miRNAs 
and target mRNAs. 


MATERIALS AND METHODS 


Data sources 
Mouse miRNA clusters data 


According to chromosome positional relationships between miRNAs, 
any two miRNAs with maximum distance less than defined 
distance were considered to be in the same cluster. We derived 
mouse miRNAs clusters from miRGen database (Megraw et al. 
2007). Forty-five miRNA clusters which contains 169 miRNAs 
were extracted when MID was set to 5 kb. Fifty-five miRNA 
clusters which contains 202 miRNAs were found when MID was 
set to 50 kb. 


Optimized targets prediction data 


Sethupathy et al. (2006) compared various miRNA target-pre- 
diction programs both individually and in combination with 
other programs. They suggested that the intersection of PicTar 
and TargetScanS prediction could achieve both high sensitivity 
and specificity compared to other programs or program combi- 
nations. This optimized target prediction data was used in this 
study. It includes 21,225 miRNA-target gene pairs, which totally 
involved 112 miRNAs and 3586 target genes. 


Mouse Biocarta pathway data 


We use the compiled pathway data by Tian et al. (2005). It is 
consisted of 277 Mouse Biocarta pathway data, and could be 


miRNA expression data 


We used three mouse miRNA expression profile data sets. 
Thomson data: the investigators used a dual-channel microarray 
to monitor expression levels of 124 mouse miRNAs in seven 
mouse tissues and four different embryonic stages, embryonic 
stem cells, and embryoid bodies (Thomson et al. 2004). Beuvink 
data: chemically modified 2’-O-(2-methoxyethyl)-(MOE) oligo- 
ribonucleotide probes were arrayed onto Evanescent Resonance 
microchips to detect 200 miRNAs expression in eight mouse 
tissues (Beuvink et al. 2007). Betel data: miRNA expression 
profiles derived from a comprehensive cloning and sequencing 
effort of 64 mouse small RNA libraries extracted from major 
organs and cell types. Expression values were reported as normal- 
ized clone counts for 400 miRNAs (Betel et al. 2008). 


Computational framework 
miRNA profiling filter 


We computed the Pearson coefficient for each pair of miRNAs 
within the same cluster. The mean correlation coefficients were 
used as a correlation measure for a miRNA cluster. For each 
miRNA profile, we also randomly sampled 1000 pairs of miRNAs 
and computed the pearson coefficient. The average value of 
positive correlation measures was used as a background correla- 
tion. Finally, we filtered out such clusters that the mean correla- 
tion measure is less than those of the background correlation in at 
least two of the three miRNA profiles. If there are <2 miRNAs 
within the same cluster measured in the three expression profiles, 
the miRNA cluster passes the filter automatically as currently 
there are not enough evidences to show this cluster does not 
coexpressed. To assess the statistical significance of the correlation 
measure for each miRNA cluster, the fraction of 1000 random 
sampling pairs of miRNAs having equal to or larger than the 
correlation measure was reported as nominal P value. 


Regulatory elements filter 


We used FirstEF and Eponin programs to search for possible 
regulatory elements among the inter-miRNA region within the 
same cluster. FirstEF is a decision tree based promoter detection 
programs to predict exon, promoter, and CpG window of se- 
quences (Davuluri et al. 2001). The Eponine program provides a 
probabilistic method for detecting transcription start sites (TSS) 
in the mammalian genomic sequence (Down and Hubbard 2002). 
For the analysis, we used the UCSC human multiz28way conser- 
vation track to get the corresponding mouse regions. All the 
sequence analyses were done at Galaxy platform (Giardine et al. 
2005; Karolchik et al. 2008). We filtered out such clusters that the 
regulatory elements were found by any one of the two detection 
programs. miRNA cluster that passes both miRNA profiling filter 
and regulatory elements filter were used in the following analysis. 


Filtering pathways 


For the 277 Biocarta signaling pathways, there are some pathways 
that only a few or even none of its component genes are predicted 


as miRNA targets in the optimized target prediction data. Only 
those signaling pathways with at least 50% of their component 
genes listed in the optimized target prediction data as target genes 
were selected for further analysis. After this selection, there 
remained 97 signaling pathways. 


Finding significant interactions between clusters 
and pathways 


For each cluster and each signaling pathway, we first found 
predicted target genes of all the miRNA members within this 
cluster. Then, we calculated the number of irredundant over- 
lapping target genes with the component genes on the signaling 
pathway. One thousand simulated miRNA clusters of the same 
size were also randomly sampled from the 112 miRNAs with 
target prediction and then the corresponding number of over- 
lapping targets was recorded. We set the P value of the cluster— 
pathway pair as the fraction of 1000 random sampling having 
equal to or larger than the number of overlapping targets of the 
real miRNA cluster. The P value based on such randomizations 
was used to assess whether the observed number of overlapping 
genes for a cluster-pathway pair was achieved by chance. Multiple 
statistical tests were controlled by false discovery rate (FDR) 
(Benjamini and Hochberg 1995). After applying the same pro- 
cedure to all the cluster-pathway pairs, we selected the cluster— 
pathway pair at FDR <0.3 (nominal P < 0.03). 


Experimental verification 


The siRNA duplexes mir-96, mir-153, mir-182, and mir-183 were 
purchased from Shanghai Gene-Pharma Co. siRNA duplexes with 
nonspecific sequences were used as a negative control (NC). 
Mouse genes 3'-UTR sequences were downloaded from Pictar 
website (http://pictar.bio.nyu.edu/cgi-bin/PicTar_vertebrate.cgi). 
The fragments of irs] (mm_NM_005544.0) 3’-UTR (nucleotides 
33-1001), grb2 (mm_NM_203506.0) 3’-UTR (nucleotides 82- 
598), and rasal (NM_002890.0) 3'-UTR (nucleotides 9-936) were 
PCR-amplified and subcloned into the XbaI site in pGL3-Control 
vector (Promega Corp.). Constructs were all sequence verified 
(Guangzhou Invitrogen Biotechnology Co., Ltd.). In addition to 
nonspecific sequences as a negative control, specific negative 
controls were employed (both mir-153 and mir-182 for irs1, and 
mir-183 for rasal, and grb2) because no target sites were pre- 
dicted for the corresponding miRNA oligos by both Pictar and 
TargetScanS algorithms. 

For luciferase activity assay, HeLa cells were seeded 24 h prior to 
transfection in 96-well tissue culture plates. The next day, cells were 
cotransfected with various constructs and internal control pRL-TK 
and 10 nM of each miRNA duplex using Lipofectamine 2000 in 
accordance with the manufacturer’s instructions. Cell lysate was 
collected 24 h after transfection. Firefly and Renilla luciferase 
activities were measured using a Dual Luciferase Reporter Assay 
System (Promega Corp.). All of the assays were performed in 
triplicate in at least three independent experiments. We assessed 
statistical significance using two-tailed Student's t-test. 
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